508 research outputs found
Recommended from our members
A survey of induction algorithms for machine learning
Central to all systems for machine learning from examples is an induction algorithm. The purpose of the algorithm is to generalize from a finite set of training examples a description consistent with the examples seen, and, hopefully, with the potentially infinite set of examples not seen. This paper surveys four machine learning induction algorithms. The knowledge representation schemes and a PDL description of algorithm control are emphasized. System characteristics that are peculiar to a domain of application are de-emphasized. Finally, a comparative summary of the learning algorithms is presented
Recommended from our members
Episodic learning
A system is described which learns to compose sequences of operators into episodes for problem solving. The system incrementally learns when and why operators are applied. Episodes are segmented so that they are generalizable and reusable. The idea of augmenting the instance language with higher level concepts is introduced. The technique of perturbation is described for discovering the essential features for a rule with minimal teacher guidance. The approach is applied to the domain of solving simultaneous linear equations
Machine learning research 1989-90
Multifunctional knowledge bases offer a significant advance in artificial intelligence because they can support numerous expert tasks within a domain. As a result they amortize the costs of building a knowledge base over multiple expert systems and they reduce the brittleness of each system. Due to the inevitable size and complexity of multifunctional knowledge bases, their construction and maintenance require knowledge engineering and acquisition tools that can automatically identify interactions between new and existing knowledge. Furthermore, their use requires software for accessing those portions of the knowledge base that coherently answer questions. Considerable progress was made in developing software for building and accessing multifunctional knowledge bases. A language was developed for representing knowledge, along with software tools for editing and displaying knowledge, a machine learning program for integrating new information into existing knowledge, and a question answering system for accessing the knowledge base
Recommended from our members
Learning Problem Solving
Learning to problem solve requires acquiring multiple forms of knowledge. Problem solving is viewed as a search of a state-space formulation of a problem. With this formalism, operators are applied to states to transit from the intiial state to the goal state. The learning task is to acquire knowledge of that state-space to guide search. In particular, three forms of knowledge are required: why each operator is useful, when to apply each operator, and what each operator does. A PROLOG implementation, named PET, demonstrates the learning approach in the domains of simultaneous linear equations and symbolic integration.Episodic learning is a technique for learning why individual operators are useful in a solution path. Episodic learning acquires generalized operator sequences which achieve the goal state. This is done by backing-up state evaluation and learning sub-goals in the state-space.Perturbation is a technique for learning when individual operators are useful. Perturbation guides the generalization process to discover minimally-constained preconditions for useful operator applications. This is done by experimentation, thereby reducing the teacher's role in the learning process.Learning relational models is a technique for discovering what individual operators do. Relational models are an explicit representation of the transformation performed by operators. This representation enables the learning element to reason with operator semantics to guide further learning.Episodic learning, perturbation and relational models form an integrated approach for learning problem solving. The approach demonstrates self-teaching by reasoned experimentation
ADJOINT-ASSISTED INVERSION FOR SHALLOW WATER ENVIRONMENT PARAMETERS
The adjoint of a forward model can back-propagate mismatch between observations and their predictions and produce the corrections to the forward model inputs that caused the mismatch. As an example of this process, the adjoint of a parabolic equation propagation model is used to invert errors in pressure predictions at a receiver for sound speed perturbations due to internal tides
Recommended from our members
EM-mosaic detects mosaic point mutations that contribute to congenital heart disease.
BackgroundThe contribution of somatic mosaicism, or genetic mutations arising after oocyte fertilization, to congenital heart disease (CHD) is not well understood. Further, the relationship between mosaicism in blood and cardiovascular tissue has not been determined.MethodsWe developed a new computational method, EM-mosaic (Expectation-Maximization-based detection of mosaicism), to analyze mosaicism in exome sequences derived primarily from blood DNA of 2530 CHD proband-parent trios. To optimize this method, we measured mosaic detection power as a function of sequencing depth. In parallel, we analyzed our cohort using MosaicHunter, a Bayesian genotyping algorithm-based mosaic detection tool, and compared the two methods. The accuracy of these mosaic variant detection algorithms was assessed using an independent resequencing method. We then applied both methods to detect mosaicism in cardiac tissue-derived exome sequences of 66 participants for which matched blood and heart tissue was available.ResultsEM-mosaic detected 326 mosaic mutations in blood and/or cardiac tissue DNA. Of the 309 detected in blood DNA, 85/97 (88%) tested were independently confirmed, while 7/17 (41%) candidates of 17 detected in cardiac tissue were confirmed. MosaicHunter detected an additional 64 mosaics, of which 23/46 (50%) among 58 candidates from blood and 4/6 (67%) of 6 candidates from cardiac tissue confirmed. Twenty-five mosaic variants altered CHD-risk genes, affecting 1% of our cohort. Of these 25, 22/22 candidates tested were confirmed. Variants predicted as damaging had higher variant allele fraction than benign variants, suggesting a role in CHD. The estimated true frequency of mosaic variants above 10% mosaicism was 0.14/person in blood and 0.21/person in cardiac tissue. Analysis of 66 individuals with matched cardiac tissue available revealed both tissue-specific and shared mosaicism, with shared mosaics generally having higher allele fraction.ConclusionsWe estimate that ~ 1% of CHD probands have a mosaic variant detectable in blood that could contribute to cardiac malformations, particularly those damaging variants with relatively higher allele fraction. Although blood is a readily available DNA source, cardiac tissues analyzed contributed ~ 5% of somatic mosaic variants identified, indicating the value of tissue mosaicism analyses
- …